Extracting speaker-specific functional expressions from political speeches using random forests in order to investigate speakers’ individual political styles
نویسنده
چکیده
In this study we extracted speaker-specific functional expressions from political speeches using random forests in order to investigate speakers’ individual political styles. Along with methodological development, stylistics has expanded its scope into new areas of application such as authorship profiling and sentiment analysis in addition to conventional areas such as authorship attribution and genre-based text classification. Among these, computational sociolinguistics, which aims at providing a systematic and solid basis for sociolinguistic analysis using machine learning and linguistically-motivated features, is a potentially important area. In this study we show the effectiveness of the random forests classifier for such tasks by applying it to Japanese prime ministers’ Diet speeches. The results demonstrate that our method successfully extracted the speaker-specific expressions of two Japanese prime ministers, and enabled us to investigate their individual political styles in a systematic manner. The method can be applied to sociolinguistic analysis of various other types of texts, and in this way, this study will contribute to developing the area of computational sociolinguistics in the field of stylistics.
منابع مشابه
Extracting author-specifi c expressions using random forest for use in the sociolinguistic analysis of political speeches
This study applies stylistic text classifi cation using random forest to extract author-specifi c expressions for use in the sociolinguistic analysis of political speeches. In the fi eld of politics, the style of political leaders’ speeches, as well as their content, has attracted growing attention in both English (Ahren, 2005) and Japanese (Azuma, 2006; Suzuki and Kageura, 2006). One of the ma...
متن کامل(Un)Translatability of Persian Idiomatic Expressions to English in Political Discourse
The present study sought to investigate the extent to which Persian idiomatic expressions would influence the western translators' strategies in providing the ultimate product in English, and it also attempted to uncover the underlying assumptions in target text, then to suggest some weighty strategies to overcome difficulties with translation. For this purpose, the data was analyzed within the...
متن کاملPolitical sentiment analysis: Predicting speaker attitude in the UK House of Commons
In this paper the authors seek to establish the most appropriate mechanism for conducting sentiment analysis with respect to political debates so as to predict their outcome. To this end two alternative approaches are considered, the classification based approach and the lexicon based approach. In the context of the second approach either generic or domain specific lexicons may be adopted, both...
متن کاملMachine learning and sentiment analysis approaches for the analysis of Parliamentary debates
In this thesis the author seeks to establish the most appropriate mechanism for conducting sentiment analysis with respect to political debates; firstly so as to predict their outcome and secondly to support a mechanism to provide for the visualisation of such debates in the context of further analysis. To this end two alternative approaches are considered, a classification-based approach and a...
متن کاملA Critical Study of Selected Political Elites' Discourse in English
This study explored how political elites can contribute to power enactment through using language. It started with a theoretical overview of Critical Discourse Analysis (CDA), and then presented a corpus consisting of speeches of eight political elites, namely, Malcolm X, Noam Chomsky, Martin Luther King, Josef Stalin, Vladimir Lenin, Winston Churchill, J.F. Kennedy and Adolph Hitler. This stud...
متن کامل